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DETAILED ACTION 

Information Disclosure Statement 

1 . The Examiner has considered the references listed in the Information Disclosure 
Statement dated 9/14/05. A copy of the Information Disclosure Statement is attached 
to this office action. 



Claim Rejections - 35 USC §112 

The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

2. Regarding claims 40-58, claim 40 recites the limitation "wherein the voicing 

information includes" in the first line of the claim. There is insufficient antecedent basis 

for this limitation in the claim. 

Regarding claims 59-77, claim 59 recites the limitation "wherein the voicing 
information includes" in the first line of the claim. There is insufficient antecedent basis 
for this limitation in the claim. 

Claims 66 and 67 are rejected under 35 U.S.C. 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. As written, claim 66 depends upon claim 67 and 
claim 67 depends upon claim 66. 

The following rejections are given using reasonable interpretation of the claim 
language. 



Application/Control Number: 10/046,666 Page 3 

Art Unit: 2654 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claims 1-6, 16, 27, 28, 37-42, 44, 59, 60, 62 and 63 are rejected under 35 U.S.C. 

103(a) as being unpatentable over Griffin et al. (U.S. Patent 5,701,390), hereinafter 

referred to as Griffin, in view of Barnwell et al. ("Speech Coding: A computer laboratory 

textbook," 1966, John Wiley & Sons, Inc.), hereinafter referred to as Barnwell. 

Regarding claim 1, Griffin discloses a method for the synthesis of MBE-based 
coded speech using regenerated phase information. Griffin's method includes the 
following: 

• dividing the speech model parameters into frames, wherein a frame of speech 
model parameters includes pitch information, voicing information determining the 
voicing state in one or more frequency regions, and spectral information (col. 3, lines 4- 
12; col. 9, lines 28-35); 

• computing a first digital filter using a first frame of speech model parameters, 
wherein the frequency response of the first digital filter corresponds to the spectral 
information in frequency regions where the voicing state equals the selected voicing 
state (Fig. 2, col. 4, lines 38-65; digital filters are used to synthesize the speech, 
excited by the appropriate input [v/uv]); and col. 13, line 60 through col. 14, line 7); 
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• computing a second digital filter using a second frame of speech model 
parameters, wherein the frequency response of the second digital filter corresponds to 
the spectral information in frequency regions where the voicing state equals the 
selected voicing state (Fig. 2, col. 4, lines 38-65; parameters from sequential packets 
are loaded creating different filters, which are excited according to voicing state; and 
col. 13, line 60 through col. 14, line 7, sequential packets can overlap). 

But Griffin does not specifically teach the following: 

• determining a set of pulse locations; 

• producing a set of first signal samples from the first digital filter and the pulse 
locations; 

• producing a set of second signal samples from the second digital filter and the 
pulse locations; 

• combining the first signal samples with the second signal samples to produce a 
set of digital speech samples corresponding to the selected voicing state. 

However, the examiner contends that this concept was well known in the art, as 
taught by Barnwell. 

In the same field of endeavor, Barnwell teaches speech coding where a filter is 
"programmed" with coefficients and excited with pulses (pp. 85-89, Fig. 5.2), where the 
pulses will necessarily have a separation (pitch period — location) and sequential sets of 
samples (from frames or subframes) will produce a signal. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Griffin by specifically providing the features, 
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as taught by Barnwell, because it is well known in the art at the time of invention for the 
purpose of producing synthesized speech at a decoder using low bandwidth 
transmissions (Barnwell, p. 85, Introduction). 

Regarding claim 2, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1). In addition, Griffin teaches "wherein the frequency 
response of the first digital filter and the frequency response of the second digital filter 
are zero in frequency regions where the voicing state does not equal the selected 
voicing state" (col. 13, line 62 through col. 14, line 6). 

Regarding claim 3, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). In addition, Griffin teaches "wherein the spectral 
information includes a set of spectral magnitudes representing the speech spectrum at 
integer multiples of a fundamental frequency" (col. 4, lines 55-61 ). 

Regarding claim 4, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). In addition, Griffin teaches "wherein the speech model 
parameters are generated by decoding a bit stream formed by a speech encoder" (col. 
9, lines 26-30). 

Regarding claim 5, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). In addition, Griffin teaches "wherein the voicing 
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information determines which frequency regions are voiced and which frequency 
regions are unvoiced" (col. 13, line 60 through coL 14, line 5). 

Regarding claim 6, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 5). In addition, Griffin in view of Barnwell (see rejection of 
claim 1) teaches "wherein the selected voicing state is the voiced voicing state and the 
pulse locations are computed such that the time between successive pulse locations is 
determined at least in part from the pitch information" (in particular, Barnwell, Fig. 5.2, 
the pitch period determines the space between the excitation pulses). 

Regarding claim 16, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 2). Barnwell teaches "wherein the selected voicing state is a 
pulsed voicing state" (p. 88, Fig. 5.2, voiced excitation can be generated by a pulse 
generator in support of low bandwidth transmission, see claim 1 rejection). 

Regarding claim 27, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1). In addition, Griffin teaches "wherein the spectral 
information includes a set of spectral magnitudes representing the speech spectrum at 
integer multiples of a fundamental frequency" (col. 4, lines 55-60). 

Regarding claim 28, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1). In addition, Barnwell teaches "wherein the speech model 
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parameters are generated by decoding a bit stream formed by a speech encoder (col. 
3, lines 4-22). 

Regarding claim 37, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 1). In addition, Griffin teaches "wherein the digital speech 
samples corresponding to the selected voicing state are further combined with other 
digital speech samples corresponding to other voicing states" (Fig. 2, col. 13, line 62 
through col. 14, line 7). 

Regarding claim 38, this claim has limitations similar to claim 1 and is rejected 
for the same reasons. 

Regarding claim 39, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 38). In addition, Griffin teaches "wherein the digital speech 
samples for the subframe corresponding to the selected voicing state are further 
combined with digital speech samples for the subframe representing other voicing 
states" (Fig. 2, col. 13, line 62 through col. 14, line 7). 

Regarding claim 40, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 39). In addition, Griffin teaches "wherein the voicing 
information includes one or more voicing decisions, with each voicing decision 
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determining the voicing state of a frequency region in the subframe" (col. 13, line 62 
through col. 14, line 7). 

Regarding claim 41, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 40). In addition, Griffin teaches "wherein each voicing 
decision determines whether a frequency region in the subframe is voiced or unvoiced" 
(col. 13, line 62 through col. 14, line 7). 

Regarding claim 43, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 41 ). In addition, Barnwell teaches "wherein each voicing 
decision further determines whether a frequency region in the subframe is pulsed" (Fig. 
5.2 voicing selected the pulse generator that generates the appropriate frequency 
response when passed through the filter. 

Regarding claim 44, Griffin in view of Barnwell teaches everything claimed, as 
applied above (see claim 41 ). In addition, Griffin in view of Barnwell teach "wherein the 
selected voicing state is the voiced voicing state and the pulse locations depend at 
least in part on the decoded pitch information for the subframe" (Griffin, Fig. 2, 
decodes information resulting in Vr going to "voicing band determination" module; 
Barnwell, Fig. 5.2, pitch and voicing information go to pulse generator). 
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Regarding claim 59, this claim has limitations similar to claim 40 and is rejected 
for the same reasons. 

Regarding claim 60, this claim has limitations similar to claim 41 and is rejected 
for the same reasons. 

Regarding claim 62, this claim has limitations similar to claim 43 and is rejected 
for the same reasons. 

Regarding claim 63, this claim has limitations similar to claim 44 and is rejected 
for the same reasons. 

4. Claims 7, 42, 45, 46, 49, 61 , 64, 65 and 68 are rejected under 35 U.S.C. 103(a) 
as being unpatentable over Griffin in view of Barnwell and further in view of well known 
prior art (MPEP 2144.03). 

Regarding claims 7, 42, 45, 61 and 64 Griffin in view of Barnwell teaches 
everything claimed, as applied above (see claim 6, 41 , 60, 63, respectively), but Griffin 
in view of Barnwell does not specifically teach "the pulse locations are reinitialized if 
consecutive frames or subframes are predominately not voiced, and future determined 
pulse locations do not substantially depend on speech model parameters corresponding 
to frames or subframes prior to such reinitialization." However, the examiner takes 
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official notice of the fact that reinitialization after a period of non-pulsed operation was 
well known in the art. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Griffin in view of Barnwell, because voiced 
operation is more accurate of the pulses are synchronized to the beginning of a voiced 
segment. 

Regarding claims 46, 49, 65 and 68 Griffin in view of Barnwell teaches 
everything claimed, as applied above (see claim 45, 43, 63, and 62, respectively), but 
Griffin in view of Barnwell does not specifically teach "the frequency responses of the 
first impulse response and the second impulse response correspond to the decoded 
spectral information in voiced frequency regions and the frequency responses are 
approximately zero in other frequency regions." However, the examiner takes official 
notice of the fact that a pulsed excitation will generate a frequency response and that 
the non-voiced segments will typically have a much lower energy response. 

Therefore, it would have been obvious to one having ordinary skill in the art at 
the time the invention was made to modify Griffin in view of Barnwell, because voiced 
operation is more accurate of the pulses are synchronized to the beginning of a voiced 
segment. 
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Allowable Subject Matter 

5. Claims 8-1 5, 1 7-26, 29-36, 47, 48, 50-58, 66, 67 and 69-77 are objected to as 
being dependent upon a rejected base claim, but would be allowable if rewritten in 
independent form including all of the limitations of the base claim and any intervening 
claims. Note the 1 12 2 nd rejections of claims 47, 48, 50-58, 66, 67 and 69-77 has 
precedence. 

Regarding claims 10, 19, 31, 50 and 69, Griffin discloses the synthesis of MBE- 
based coded speech, but Griffin does not teach determining FFT coefficients from the 
decoded model parameters for the first frame in frequency regions where the voicing 
state equals the selected voicing state; processing the FFT coefficients with an inverse 
FFT to compute first time-scaled signal samples; interpolating and resampling the first 
time-scaled signal samples to produce first time-corrected signal samples; and 
multiplying the first time-corrected signal samples by a window function to produce the 
first digital filter. Thus the cited prior art alone or in combination, does not fairly suggest 
or disclose the claimed combination of features. 

Regarding claims 8, 17 and 29, Griffin discloses method for reconstructing the 
spectral envelope and voicing information for each of a plurality of frames, but Griffin 
does not teach that the first digital filter is computed as the product of a periodic signal 
and a pitch-dependent window signal, and the period of the periodic signal is 
determined from the pitch information for the first frame. Thus the cited prior art alone 
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or in combination, does not fairly suggest or disclose the claimed combination of 
features. 

Regarding claims 25, 47 and 66, Griffin discloses method for synthesizing 
speech that includes the use of sinusoidal oscillators determined in part from the from 
the fundamental frequency, but Griffin does not teach that the pulse location 
corresponds to a time offset associated with an impulse in an impulse sequence, the 
first signal samples are computed by convolving the first digital filter with the impulse 
sequence, and the second signal samples are computed by convolving the second 
digital filter with the impulse sequence. 

Citation of Pertinent Art 

6. The following prior art made of record but not relied upon is considered pertinent 
to the applicant's disclosure: 

• George et al. (U.S. Patent 5,327,518) discloses an audio analysis/synthesis 
system. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to V. Paul Harper whose telephone number is (571 ) 272- 
7605. The examiner can normally be reached on M-F. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



V. Paul Harper 
Patent Examiner 
Art Unit 2654 



9/15/2005 




